A Bio-inspired Clustering Approach for Dynamic Document Distributed Analysis

نویسندگان

  • Xiaohui Cui
  • Thomas E. Potok
چکیده

Document clustering is a fundamental operation used in unsupervised document organization, automatic topic extraction and information retrieval. But most clustering technologies are limited in their application on the static document collection. Intelligence analysts are currently overwhelmed with tremendous amount of text information streams generated everyday. There is a lack of comprehensive tool that can real-time analyze the dynamic changed information streams. In this paper, we propose a bio-inspired clustering model, the Multiple Species Flocking clustering model (MSFC), and present a distributed multi-agent MSFC approach for clustering dynamic updated text information streams. The decentralized architectures and communication schemes of the MSFC multi-agent distributed implementation for load balance and status information synchronization are also discussed in this article.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Distributed Agent Implementation of Multiple Species Flocking Model for Document Partitioning Clustering

The Flocking model, first proposed by Craig Reynolds, is one of the first bio-inspired computational collective behavior models that has many popular applications, such as animation. Our early research has resulted in a flock clustering algorithm that can achieve better performance than the Kmeans or the Ant clustering algorithms for data clustering. This algorithm generates a clustering of a g...

متن کامل

Hybrid Bio-Inspired Clustering Algorithm for Energy Efficient Wireless Sensor Networks

In order to achieve the sensing, communication and processing tasks of Wireless Sensor Networks, an energy-efficient routing protocol is required to manage the dissipated energy of the network and to minimalize the traffic and the overhead during the data transmission stages. Clustering is the most common technique to balance energy consumption amongst all sensor nodes throughout the network. I...

متن کامل

A General Bio-inspired Method to Improve the Short-Text Clustering Task

“Short-text clustering” is a very important research field due to the current tendency for people to use very short documents, e.g. blogs, text-messaging and others. In some recent works, new clustering algorithms have been proposed to deal with this difficult problem and novel bio-inspired methods have reported the best results in this area. In this work, a general bio-inspired method based on...

متن کامل

Dynamic Data Mining: Synergy of Bio-Inspired Clustering Methods

Dynamic data mining (DDM) comprises advantages of static methods used to reveal implicit structure of classes and at the same time benefits from high quality results obtained in the field of time series analysis. Clustering problem is recognized to be the most crucial in almost any knowledge domain: telecommunications and networking, nanotechnology, physics, chemistry, biology, health care, soc...

متن کامل

A P2P-based Flocking Algorithm for Distributed Clustering using Small World Structure

Clustering has become an increasingly important task in modern application domains such as electronic commerce, multimedia, surveillance using sensor networks as well as many others. In many of these areas, the data are originally collected at different sites and their transmission to a central site is almost impossible. This requires to develop novel distributed clustering algorithms to handle...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006